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A 10- kb region of the Bacillus subtilis genome that contains genes involved in biotin biosynthesis was cloned 
and sequenced. DNA sequence analysis indicated that B. subtilis contains homologs of the Escherichia coli and 
Bacillus sphaericus bioA, bioB, bioD, and bioF genes. These four genes and a homolog of the B. sphaericus bioW 
gene are arranged in a single operon in the order bioWAFDB and are followed by two additional genes, biol and 
or/2, biol and or/2 show no similarity to any other known biotin biosynthetic genes. The biol gene encodes a 
protein with similarity to cytochrome P-450s and was able to complement mutations in either bioC or bioH of 
E. coli. Mutations in biol caused B. subtilis to grow poorly in the absence of biotin. The bradytroph phenotype 
of biol mutants was overcome by pimelic acid, suggesting that the product of biol functions at a step prior to 
pimelic acid synthesis. The B. subtilis bio operon is preceded by a putative vegetative promoter sequence and 
contains just downstream a region of dyad symmetry with homology to the bio regulatory region of B. 
sphaericus. Analysis of a bioW-lacZ translational fusion indicated that expression of the biotin operon is 
regulated by biotin and the B. subtilis birA gene. 



Biotin biosynthesis in Escherichia coli and Bacillus sphaericus 
has been studied extensively at both the biochemical and mo- 
lecular biological levels (9, 14, 17, 29). The enzymes involved in 
the conversion of pimeloyl coenzyme A (CoA) to biotin have 
been isolated from both of these bacterial species and charac- 
terized (2, 14, 16, 23, 29, 42). The analogous pairs of enzymes 
from the two species are similar, although some of the com- 
ponents involved in the last step in biotin synthesis remain to 
be elucidated (6, 15, 25, 26, 37, 46). 8-Amino-7-kctopelargonic 
acid (KAPA) synthase, the product of bioF, catalyzes the con- 
version of pimeloyl-CoA and alanine to KAPA (Fig. 1). 7,8- 
Diaminopelargonic acid (DAP A) aminotransferase, the prod- 
uct of bioA, then uses S-adenosylmethionine as a donor to 
transfer an amino group to KAPA, yielding DAP A. Dethiobi- 
otin (DTB) synthetase (bioD) catalyzes the closure of the ure- 
ido ring to produce DTB, and finally the product of bioB, 
biotin synthase, functions together with a number of other 
components, including flavodoxin (6, 26), S-adenosylmethio- 
nine (6, 15, 25, 37, 46), and possibly cysteine (6, 15, 47), to 
convert DTB to biotin. 

In E. coli the genes that encode these enzymes are located in 
two divergently transcribed operons, controlled by a single 
operator that interacts with the BirA repressor (1, 9). In B. 
sphaericus, the genes are located in two separate, unlinked 
operons (17). The early steps of the pathway, those involved in 
the synthesis of pimeloyl-CoA, are less well understood (27, 
48). B. sphaericus contains an enzyme, pimeloyl-CoA syn- 
thetase (bio W), that converts pimelic acid to pimeloyl-CoA (17, 
43). E. coli lacks this enzyme and cannot use pimelic acid as an 
intermediate in biotin synthesis (17, 27, 48). E. coli contains 
two genes, bioC, which is located in the bio operon, and bioH, 
which is unlinked to the other bio genes, that appear to be 
involved in the early steps of biotin biosynthesis leading up to 
pimeloyl-CoA, but their exact roles are unknown (14, 32). 
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Although there are no obvious homologs olbioC or bioH in the 
two sequenced bio operons of B. sphaericus, Lemoine et al. 
(32) have suggested that both the BioC protein of E. coli and 
the BioX protein of B. sphaericus may function as acyl carrier 
proteins involved in pimeloyl-CoA synthesis. Like most acyl 
carrier proteins, BioX possesses a consensus sequence for a 
phosphopantetheine attachment site. BioC does not possess 
such an attachment site; however, Lemoine ct a!. (32) pro- 
posed that BioC functions in a way similar to that of chalcone 
synthase, an enzyme which does not require the 4'-phospho- 
pantetheine group. They have also identified a consensus se- 
quence in BioH protein which is characteristic of acyltrans- 
ferase and thioeslerasc proteins. 

Prior to this work, little was known about the biotin biosyn- 
thetic genes in Bacillus subtilis. Pai (40) had isolated a collec- 
tion of biotin auxotrophs and shown that they all map at the 
same locus on the chromosome (262°) and are weakly linked to 
aroG by transformation. On the basis of nutritional require- 
ments and excreted products, the mutants could be divided 
into three classes that appeared to correspond to E. coli mu- 
tations in bioB, bioA, and bioF (17, 40). Here we report that the 
bio genes of B. subtilis are located in a single operon and that 
genes with similarity to bioW, bioA, bioF, bioD, and bioB are 
found in this operon. In addition, the B. subtilis operon con- 
tains two other genes that correspond to no other known bio 
genes. 



MATERIALS AND METHODS 

Bacterial strains, plasmids, and growth conditions. Bacterial strains used in 
this study arc listed in Tabic 1. Plasmids pUC9 (57), pUC19 (61), pCL1921 (33). 
pJGP44, and pTR264 (31) were used for cloning into E. coli. pJGP44 is a 
derivative of pBR322 that contains an 82-bp polylinkcr with multiple restriction 
sites inserted between the filled EcoKl site and the Mwl site of pBR322 (6a). 
E. coli strains were grown on Luria-Bertani medium without glucose. Competent 
E. coli was prepared by the method of Inoue (28) or purchased from Bethesda 
Research Laboratories, Inc. E. coli cells transformed by electropo ration were 
prepared, stored, and transformed as described by Dower et al. (12). B. subtilis 
cells were grown on Tryptosc Blood Agar Base (Difco) plates or in veal infusion 
broth-yeast extract (VY) broth (7). Competent B. subtilis was prepared, stored, 
and transformed as described by Dubnau and DavidofF-Abelson (13). Plasmid 
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FIG. 1. Biotin biosynihcsis pathways in £T. cotf, ft subtilis, and fl. spliaericus. 
The question marks indicate that the patliways for the synthesis of the interme- 
diates pimeloyl-CoA in E. coli and pimelic acid in B. subtilis and B. sphaericus are 
not known. The last reaction is catalyzed by the bioB gene product; the potential 
sulfur donor cysteine (Cys-[SJ) and the additional proteins and cofactors listed 
are based on in vitro studies using coli cell extracts (6. 15. 25, 26). asp, 
aspartate: asn, asparagine: glu, glutamale; ser, serine: L-Ala, alanine; SAM. 
.S-adenosyl-/.-methionine. 



DNA from E. coli was prepared by using purification kits purchased from Qia- 
gen. Inc. 

Cloning of the biotin operon. The positive selection vector pTR264 (31) was 
used to construct a library of ~8- to 10-kb fragments of B. subtilis GP20S DNA 
. in E. coli. Clones with inserts were selected by plating transformants on Luria- 
Bcrtani plates with tetracycline (10 u.g/ml). 

pTR264. prepared in E. coli dam mutant strain GM48 and digested with Bell. 
was ligated with chromosomal DNA from B. subtilis GP20S which had been 
partially digested with Sau3A and fractionated on a sucrose gradient (8- to 12-kb 
fragments). E . coli biotin mutants R879 (bioA), R875 (bioB), R878 (bioC), R877 
(hioD), and R872 (bioE) were each transformed with the ligated DNA by elec- 
troporation, and Bio + colonics were selected on BIOS medium (7). Bio + trans- 
formants that were also Tc r were analyzed for plasmid content. 

Cloning of a B. subtilis fragment containing the 5' end of the bio operon. 
Analyses of restriction maps and Southern blot data using a /wW-containing 
fragment from pBIOlOO as a probe indicated that a 5.5-kb Pstl fragment would 
contain a complete bioA gene and ~2.7 kb of upstream sequences (data not 
shown). A plasmid. pBIOH6. containing this fragment was subsequently recov- 
ered when a mini plasmid library of 4.4- to 6.6-kb Pstl fragments of B. subtilis 
chromosomal DNA was transformed into E. coli BI259 (bioA pcnB) and Bio + 
colonies were selected. pBIOH6 transformed BI259 again to biotin prototrophy 
at a high frequency but did not transform R879 (bioA pcnB*) to either biotin 
prototrophy or ampicillin resistance. 

Only limited quantities of pBIOH6 were recovered from ihtpcnB strain. The 
pcnB80 allele which was used in this cloning experiment is reported to reduce the 
copy number of pBR322 rcplicons to about 6% of wild-type yields (34). To 
improve plasmid yields without impairing plasmid stability, the unique BamlU 
site in the 3' end of bioW was used to subclone a 2.8-kb BamHl-Pstl fragment 



from pBIOH6 into a low-copy-numbcr plasmid, pCLI921 (33). A plasmid, 
pBIO350. that contained the correct 2.8-kb BattiUhPstl fragment was recovered. 
The quantity of pBIO350 recovered from this strain was significantly higher than 
that of pBIOl 16 isolated from the pcnBSO strain. 

Construction of deletions in the biotin operon. The 10-kb EcoKhto-BamUl 
fragment that contained most of the bio operon (except for part of bioW and the 
promoter) was cloned from B. subtilis GP275 (an isogenic strain of GP208) into 
/•:coRI- and /tomHI-digesled pJGP44 to give pBIO201. Several deletion mutants 
and subclones were made from pBIO201 in order to roughly locate the B. subtilis 
bio genes corresponding to the known E. coli bio genes by complementation. 
Deletions were made by cutting with the appropriate restriction enzyme, filling in 
overhangs with Klenow fragment when necessary, and religating. Subclones were 
made into pUC9. 

The 1.5-kb /:coRI-to-QfiI fragment of pBIO20l was removed to give 
pBIO202. the 1.6-kb 7xoRI-lo-.Y/ioI deletion gave pBIO203. the 4.5-kb EcoRU 
lo-Asp 718 deletion gave pBIO204, the 5.2-kb EcoRl-lo-Smal deletion gave 
pBIO205. and the 7-kb deletion from EcoRl to the rightmost EcoRV gave 
pBIO206. The 4.3-kb BamlU-toSmal deletion gave pBIO207, the 3.6-kb insert 
////idlll-to-polylinkcr //mdlll deletion gave pBIO208, and the 3.9-kb Bglll-to- 
Bglll deletion gave pBIO209. The 2.6-kb central Pstl subclone gave pBIO210, the ' 
central 4.1-kb EcoRV subclone (into the Smal site of pUC9) gave pBI02ll, and 
the 3.3-kb EcoRl-to-EcoRV subclone (into the EcoRl-io-Smal backbone of 
pUC9) gave pBI0212. 

Construction of clones of biol and/or or[2. Copies of hiol and orf2 were 
generated by PCR using a Bochringcr Mannheim PCR kit. A llindlll site was 
introduced at the 5' end of each gene, a BamlU site was introduced at the 3' end 
of biol, and an Asp7\Sl site was introduced at the 3' end of orfl. The PCR- 
generated fragments were each cloned into three plasmids with different copy 
numbers, i.e.. the low-copy-number plasmid pCLl921: a medium-copy-number 
plasmid, pJGP44; and the high-copy-number plasmid pUCl9. In two of these 
recombinant plasmids expression of biol and orf2 is under the control of the lac 
promoter (pCL1921 and pUC19). 

DNA sequencing. The B. subtilis bio genes contained on clones pBIOlOO and 
pBIO350 were sequenced by the Sanger didcoxy sequencing method using Sc- 
quenase kits, version 2.0 (United States Biochemicals, Cleveland, Ohio) as in- 
structed by the manufacturer. The strategy used to obtain the DNA sequence of 
the 8- to 10-kb region was to divide the region into four plasmid subclones of 
approximately 2 to 3 kb and then make nested sets of deletions progressing 
through each subclone. To generate (he nested deletions, the exonuclease III- 
cndonuclcase SI method was used; the reagents were purchased as a Gcncrasc 
kit (instructions included; Promcga, Madison, Wis.). Nested deletions were made 
from both ends for three of the subclones and from one end for the fourth. 
Sequencing both sets of nested deletions for three of the subclones gave the 
sequence of both strands of each subclone. For pBIO350. one strand was deter- 
mined similarly and the opposite strand was determined by synthesizing sequenc- 
ing primers at intervals of approximately 150 bp. The junctions between non- 
overlapping subclones were confirmed by sequencing from synthetic primers 
using pBIO201 or pBIOlOO (or subclones thereof) as a template. The sequences 
were aligned and compared with the DNASTAR computer program (DNASTAR, 
Inc.. Madison, Wis.). 

Construction of cat insertions. A cat cassette, encoding chloramphenicol re- 
sistance, derived from pMIllOl (62) was inserted by ligation into the coding 
region of bioW by using a BamHl site: between the BspEl and Pmll sites, deleting 
260 bp oibioB; into biol by using a Smal site; between a pair of Ssll sites, deleting 
457 bp of orf2 plus 149 bp of downstream sequences; into orft by using an Xmnl 
site; into orf6 by using an EcoRV site; and between the pair of BslBl sites, 
deleting orf4. The cat cassette was also used to entirely replace the bio promoter 
region by ligating it between the Ilpal sites. In each of the orf2-Sstl, orf4-BstB\. 
and bioB-BspEl-Pmll constructions, the cat gene was inserted in only one direc- 
tion. In all other constructions, two different plasmid derivatives, in which the cat 
cassette was inserted in either possible orientation, were generated. Each of 
these mutations was then integrated into the bio locus by first linearizing the 
^/-containing plasmid by a restriction enzyme cut outside of the bio DNA; then 
transforming this cut DNA into a competent prototrophic B. subtilis strain, 
PY79; and then selecting for chloramphenicol resistance (Cm r ) at a final chlor- 
amphenicol concentration of 5 u.g/ml. 

Construction of a bioW-lacZ fusion. To construct a bioW-lacZ translational 
fusion, a 3.1-kb BamHl-to-BgM fragment containing most of the coding region 
of E. coli lacZ (amino acid residues 24 to 1021) was ligated into the BamHl site 
of pBIO350. to give pBI0397. The bioW-lacZ fusion was then used in the 
construction of a second plasmid to allow integration of the fusion into the 
modified SPp prophage SP^2Je/2::Tn9i7::pSK10A6 (63). To bring about this 
integration, the following four DNA fragments were ligated together to generate 
plasmid pBIO407: a 6-kb Pstl-lo-Kpnl fragment of pBI0397 containing the 
bioWAacZ fusion, a PCR-gene rated 2-kb Kpn\-Ko-BamYR fragment containing 
the oriC and repA region of pCL192U a PCR-gene rated 1.2-kb Pstll-to-Sall 
fragment containing the cat gene of pC194 (22), and a PCR-generated Safl-to- 
Bglll fragment containing the pUC9 bla gene. pBIO407 contains the bla, lacZ, 
and selectable cat genes in the appropriate orientation to allow integration of the 
bioW-lacZ fusion into the SPpc2<fc/2::TnO/7::pSK10A6 prophage of ZB493 (63). 
A specialized transducing lysate containing SPp: \bioW-lacZ was obtained by heat 
induction at 50°C 
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TABLE 1. Bacterial strains used in cloning, complementation, and analysis of B. subtilis bio genes 



Strain 



Relevant genotype or description 



Source or rcfcrcncc(s) 



B. subtilis 
PY79 
BI421 
JKB3173 
BGSC1A92 
JKB3112 
GP208 
GP275 
ZB493 

£. coli 
YMC9 
DH5ot 
GM48 
R872 
R875 
R877 
R878 
R879 
BM7086 
BI259 



SPp c prototroph 
birA 

bioA173 aroG932 

bioBJ4J aroG932 sacA321 argA2 

biofltt aroG932 

leu amyE bapr A/ipr Aisp-I (Met~) 

leu amyE bapr bnpr (Met") bepr Abpr Cunpr Mipr 

trpC2 pheAl obrB703 SPfk2rfe/2::Tn977::pSK10A6 



MacU169 endAl hsdR17 supE44 tlti-1 

F" (f&0dlacZMM15) MacU169 recAl endAl lisdR17 supE44 thi-1 gyrA relAl 

F" thr leu thi lacY galK galT ara fltyA tsx dam dan supE44 

bioF3 

bioBH 

bioD!9 

bioCJS 

bioA24 

A(mal-bioH) gal 
bioA24 pcnB80 



62 
7 

17, 40 

Bacillus Genetic Stock Center 

17, 40 

49 

50 

63 



Bethesda Research Laboratories 

New England Biolabs 

8 

8 



19 

This study 



Partial diploids were generated by transforming the appropriate Bio + B. sub- 
tilis strain to Cm r with the art-containing transducing phage. These partial dip- 
loids were then grown in Spizizen's minimal salts medium (52) containing 0.4% 
glucose and 0.04% sodium glutamate in the presence or absence of biotin (10 
Hg/liler). Samples were harvested at mid-exponential phase for o-nitrophenyl-0- 
u-galactoside assay (35). 

Nucleotide sequence accession number. The DNA sequence of 10.2 kb includ- 
ing the bio opcron has been submitted to GcnBank under accession number 
U51868. 



RESULTS AND DISCUSSION 

Cloning of the B. subtilis biotin genes. A plasmid library of 
random B. subtilis partial Sau3A fragments (~8 to 12 kb) was 
constructed in E. coli by using the positive selection vector 
pTR264 as described in Materials and Methods. The library 
was used to transform £. coli bio mutants R879 (bioA24\ R875 



(bioB17), R878 (bioC23), R877 (bioD19), R872 (bioF3), and 
BM7086 (bmalA-bioH) (8, 19). Bio + transformants containing 
plasmids that complemented each E. coli bio mutation were 
recovered. Plasmids pBIOlOO and pBIOlOl were isolated by 
complementation in R879 (bioA); plasmids pBIO102 and 
pBIO103 were isolated by complementation in R877 (bioD); 
plasmid pBIO104 was isolated by complementation in R872 
(bioF); plasmids pBIO109 and pBIOllO were isolated by 
complementation in BM7086 (AbioH); and plasmids pBIOlll 
and pBIOH2 were isolated by complementation in R878 
(bioC). Initial restriction analysis of the isolated plasmids in- 
dicated significant overlap of the cloned DNA fragments, sug- 
gesting that the B. subtilis biotin locus contains genes function- 
ally equivalent to the E. coli genes bioA, bioC, bioD, bioF, and 
bioH (Fig. 2). pBIOlOO extended the farthest to the right, 
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FIG. 2. Physical map of the B. subtilis bio operon and flanking DNA. The locations of Ihe structural genes, the putative promoter, the regulatory regions, and the 
transcription termination sites were determined from the nucleotide sequence of the 10.2-kb BstXl-Pstl DNA region. Assignment of the bin genes is described in the 
text. Complc mentation of E. coli bio mutants by plasmids containing cloned fragments of the B. subtilis bio opcron and flanking regions is indicated by plus signs; no 
complementation is indicated by minus signs. Mutations not listed were not tested. Endpoints of DNA segments carried by pBIOlOO, pBIO109, and pBIOH2 arc 
approximate. Symbols: I — I, ORF: |. Bacillus RBS: 9- putative rho-independent transcription termination site: 1, possible start site of transcription for a o*- 
recognized promoter. 
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—300 bp beyond the unique BamHl site at the right end of the 
restriction map of the bio locus shown in Fig. 2. pBIOHO 
extended the farthest to the left, —1,100 bp beyond the EcoRl 
site at the other end of the restriction map (data not shown). 
Southern blots indicated that the insert DNA of pBIOlOO was 
derived from a single continuous segment of the B. subtilis 
chromosome (data not shown). 

Complementation and marker rescue of B. sublilis and E. 
coli bio mutants with plasmids containing B. subtilis bio genes. 
To confirm that the cloned DNA of pBIOlOO contained B. 
subtilis bio genes, pBIOlOO was tested for the ability to marker 
rescue B. subtilis bio mutations (40). The plasmid restored 
biotin prolotrophy to bioA, bioB, and bioF mutants at high 
frequencies, indicating that the cloned DNA contained all or 
part of each of these B. subtilis bio genes. Several of the pBIO 
plasmids were also examined for their ability to complement E. 
coli strains with mutations in bioA, bioB, bioC, bioD, bioF, or 
bioH. Most plasmids complemented more than one E. coli 
biotin mutation (Fig. 2). The isolate pBIOH2 complemented 
E. coli mutations in bioA, bioB, bioC, bioD, bioF, and bioH 
(Fig. 2); however, pBI0112 did not complement the E. coli 
A(gal-uvrB) mutation, which removes the entire E. coli bio 
locus. 

The 9.9-kb EcoKlAo-BamHl fragment containing most of 
the bio locus was cloned into a derivative of pBR322, pJGP44, 
resulting in plasmid pBIO201. To perform complementation 
experiments with plasmids with defined endpoints, a series of 
deletions was generated from pBIO201 as described in Mate- 
rials and Methods. Each deletion-carrying plasmid was intro- 
duced into five E. coli bio mutants (bioA, bioC, bioD, bioF, and 
bioH), and complementation was scored. As shown in Fig. 2, 
the B. subtilis bio genes complementing these E. coli genes 
were located in the 8-kb fragment of DNA from BamHl to 
XhoL The removal of 5.4 kb from the left of the pBIO201 
insert (pBIO205) eliminated the ability to complement bioC 
and bioH mutants. pBIO206 contained only the rightmost 2.5 
kb of the biotin cluster and complemented only bioA and bioF 
mutants. One clone, pBIO208, in which the rightmost 4.0 kb of 
insert DNA was deleted complemented E. coli bioC and bioH 
mutants but failed to complement E. coli bioA, bioD, or bioF 
mutants. These results suggested the gene order (bioC, bioH)- 
bioD-bioF-bioA. 

Cloning of a J?, subtilis fragment containing the 5' end of the 
bio operon. As described below, DNA sequences of the right- 
most end of the cloned insert (pBIOlOO) that extended fur- 
thest to the right revealed about 300 bp of an open reading 
frame (ORF) that was homologous to B. sphaericus bioW, the 
gene encoding pimeloyl-CoA synthase (17, 43), followed im- 
mediately by genes with strong similarity to bioA, bioF, bioD, 
and bioB from E. coli and B. sphaericus (Fig. 2). The 5' end of 
bioW and the promoter of the bio operon were not present on 
any of the originally cloned DNA fragments. Suspecting that it 
might be difficult to clone this region in high-copy-number 
plasmids, we cloned DNA fragments containing bioA and the 
adjacent upstream region by complementation in an E. coli 
strain containing a bioA mutation and a pcnB mutation to 
reduce plasmid copy number (34) as described in Materials 
and Methods. 

Identification and organization of 6/0-specific coding re- 
gions and transcriptional regulatory signals. Analysis of —10 
kb of the DNA sequence from pBIOlOO and pBIO350 indi- 
cated that many or all of the B subtilis biotin biosynthetic 
genes are located in a single operon containing seven coding 
regions (Fig. 2). The bioWgenz appears to be the first gene in 
the operon. Approximately 84 bp upstream from bioW is a 
putative vegetative (a*) promoter sequence (TTGACA — 17 
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B.twbtUh y — TTTCTTCTA TTKACA C A A ACACCACAGAATAA TATAT T- 



CTAATTGTTA A.CTTT C A A TA T A A TTC GTTA A£ A A TTT A C — 3J bp — US — 
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FIG. 3. Comparison of the nucleotide sequences of the B. sphaericus 
hinDAYB regulatory region and the putative H. subtilis hio promoter and regu- 
latory region. The upper sequence represents the putative B. subtilis bio pro- 
moter and regulatory region. The lower sequence represents the B. sphaericus 
bioDAYB regulatory region (17). The sequence shown spans nucleotides 1995 to 
2072 of the nucleotide sequence submitted to GenBank (accession no. U51868 
[see Materials and Methodsl). Symbols: double bold underline. 15-bp putative 
regulatory region of #. sphaericus bioDAYB: dashed lines, regions of dyad sym- 
metry; single bold underlines, -35 and -10 regions of a possible promoter. RBS. 
putative Bacillus RBS. The nucleotides above or below the sequences were 
displaced to facilitate sequence alignment. 



bp — TATATT [36]). This probable promoter sequence was 
followed by a 33-bp segment with strong sequence homology to 
the "regulatory" sites of the B. sphaericus bio operons and 
lesser similarity to the E. coli bio operator site. Comparison of 
the nucleotide sequences of this region with those of the 5' 
noncoding region of the B. sphaericus bioDAYB operon (51) 
revealed two clusters of conserved nucleotides (13 and 11 bp) 
separated by a nonconserved 9-bp segment (Fig. 3). 

The bioW gene (259 amino acids) is followed by ORFs with 
homology to bioA (448 amino acids), bioF (389 amino acids), 
bioD (231 amino acids), and bioB (335 amino acids) (Table 2). 
The next two ORFs, biol (395 amino acids) and orf2 (253 
amino acids), showed no sequence similarity to bioC or bioH or 
to any other known bio gene (Fig. 2 and Table 2). Comparison 
with the protein database of GenBank, however, indicated 
significant similarity of the deduced amino acid sequence of 
biol to those of cytochrome P-450 enzymes from Bacillus mega- 
terium (P^O^^ [21]), Saccharopolyspora erythraea (EryF 
[20] and EryK [53]), and other organisms (53). Cytochrome 
P-450s include monooxygenases known to catalyze hydroxyla- 
tion of many different kinds of substrates, including fatty acids. 
Since synthesis of pimelic acid, a precursor to biotin, might 
involve hydroxylation and/or further oxidation of a fatty acid, 
biol may be involved in an early step in biotin synthesis (see 
below). Although similar protein database searches did not 
reveal a specific function for the orf2 gene product, significant 
similarity between the N-terminal end of the deduced protein 
and putative NAD or NADH binding sites of short-chain al- 
cohol dehydrogenases (e.g., BphB [3, 55]), dehydratases (e.g., 
RfbB [30]), and the p-ketoreductasc domain of EryA,, of 
S. erythraea (11) was detected. Since this region of Orf2 also 
contains a GXGXXG motif, which is characteristic of a FAD 
or NAD binding site (60), it is conceivable that orfl encodes an 
NADH- or NADPH-dependent enzyme. 

Each gene in the bio operon is preceded by a ribosome 
binding site (RBS), with calculated AGs ranging from -10.8 to 
- 18.6 kcal (ca. -45.2 to -77.8 kJ)/mol (Table 2). All genes are 
oriented in the same transcriptional direction (right to left). In 
addition, the 5' ends of bioA, bioF, bioD, and bioB overlapped 
the 3' ends of the genes preceding them, suggesting that ex- 
pression of these genes could be regulated, in part, by trans- 
lational coupling, biol and orf2 are separated from the genes 
that precede them by 68- and 67-bp intercistronic regions, 
respectively. 

orf2 appears to be the last gene in the bio operon, as it is 
immediately followed by a region of dyad symmetry resembling 
a rho-independent transcription termination site (AG = - 15.4 
kcal [ca. -64.4 kJ]/mol). Another stem-ioop structure with 
terminator-like features was detected in the region between 
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TABLE 2. Enzymes, genes, and regulatory elements of the B. subtilis bio operon and flanking DNA. 



% Amino acid identity to corresponding 



Gene 


RBS AG 
(Kcai/mol) 


Predicted 
start codon 


Enzyme or function 


Calculated no. 
of amino acids 


Estimated 
M r 


E. coli b 


gene product from: 
B. sphaerictts? 


Other 


bioW 


-10.8 


ATG 


Pimeloyl-CoA synthase 


259 


29.633 




44 




bioA 


-15.8 


ATG 


DAP A aminotransferase 


448 


50.118 


34 


44 




biof 


-11.6 


TTG 


KAPA synthase 


389 


42.567 


35 


50 




bioD 


-18.6 


TTG 


DTB synthetase 


231 


25,114 


29 


28 




bioB 


-12.2 


ATG 


Biotin synthase 


335 


36,931 


34 


71 


22 rf 


biol 


-18.4 


GTG 


Cytochrome P-450 


395 


44,838 






30/ 33' 


oifl 


-17.6 


GTG 


Unknown 


253 


28,204 








orf3 


-20.0 


GTG 


Unknown membrane-associated 


>258 


>28,600 


53,* 24/' 23' 












transport protein 












or/4 


-10.0 


ATG 


Unknown 


299 


33,780 








orf6 


-17.4 


ATG 


Unknown regulatory protein 


>266 


>29,200 


30/ 26* 







* Calculated according to the method of Tinoco et al. (56). One kilocalorie equals 4.184 kJ. 

* Identity to /:. coli bio gene products (38). \ 
*■' Identity to H. sphaeiicus bin gene products (17). 

w Identity to E. coli lipA product (44). 

e Identity to B. megaterium cytochrome P-450 BM _i (21). 

f Identity to S. erythrttea eryE product (20). 

* Identity to B. subtilis IplC product (18). 
h Identity to coli malG product (10). 

' Identity to E. coli ugpE product (39). 
> Identity to E. coli ehgR product (54). 

* Identity to /,'. coli purli product (45). 



bioB and biol. Several secondary structures of the mRNA are 
possible, with the most favored structure having a AC of for- 
mation of -11 kcal (ca. -46 kJ)/mol and the least favored 
structure having a AG of -5.6 kcal (ca. -23 kJ)/mol. Northern 
(RNA) blots indicated that both terminator-like regions are 
functional: two steady-state transcripts originating near the 
putative P bio promoter were detected, i.e., a 7-kb RNA that 
corresponds to the predicted transcript for the entire seven- 
gene operon and a 5-kb transcript that corresponds to the first 
five genes in the operon (41). The steady-state levels of the 
5-kb transcript were, however, about eightfold greater than the 
levels of the full-length transcript, suggesting that the termi- 
nator-like structure between bioB and biol serves to limit ex- 
pression of biol (41). 

Downstream from the end of the biotin operon, a strong 
RBS (AG = -20.0 kcal [ca. -84 kJ]/mol) and 260 amino acids 
of another coding region, orf3 7 were found. The remainder of 
orf3 continues beyond the BstXl site which marks the end of 
the sequenced region. orf3 is preceded by a sequence, TGAT 
AACGCTTACA, with a perfect match to the consensus se- 
quence TG(T/A)NANCGNTN(A/T)CA for catabolite-con- 
trolled genes in B. subtilis (24, 58). The deduced amino acid 
sequence of orf3 showed significant similarity to a number of 
E. coli membrane-associated transport proteins, e.g., glycerol- 
3-phosphate permease (UgpE [39]) and maltose permease 
(MalG [10]). In particular, the partial Orf3 protein contains a 
20-amino-acid sequence common to all membrane-associated 
transport proteins (10). Significant homology (>50%) of Orf3 
protein to LplC, a transmembrane protein of B. subtilis, was 
also found (18). 

Upstream from the biotin operon is a coding region, orf4, 
preceded by an RBS and a putative promoter (Table 2). 
orf4 is followed by a region of dyad symmetry that resembles a 
rho-independent transcription termination site; this possible 
terminator is approximately 160 bp upstream from the pro- 
posed bioW start codon. Finally, further upstream from orf4, 
oriented in the opposite direction, is an ORF, orf6, extending 
266 codons to the limit of the DNA sequencing. orf6 is pre- 
ceded by an RBS and a potential a A promoter. The deduced 



amino acid sequence of otf6 showed significant similarity to 
those of a number of regulatory proteins of the E. coli LacI 
family, e.g., E. coli EbgR (54) and PurF (a repressor of the 
purine nucleotide biosynlhctic operon) (45). 

The gene-enzyme relationship, the enzyme size, and the 
percent(s) homology to the same enzyme from other organ- 
isms for each bio gene or orf are summarized in Table 2. 

Construction and analysis of a bio-lacZ translational fusion. 
A translational lacZ fusion to bioW was constructed to assess 
the activity and regulation of the putative promoter and regu- 
latory region. This was accomplished by replacing the 3' end 
of the bioW coding sequence with a 3.1-kb BamHl-Bglll frag- 
ment containing a promoterless lacZ coding region in a plas- 
mid designed to allow integration into a modified SPp pro- 
phage (see Materials and Methods). This plasmid, pBIO407, 
contains DNA extending to a position located about 2 kb 
upstream of the presumed bioW start codon and most of the 
bioW coding sequence fused to lacZ on a low-copy-number 
plasmid. pBIO407 turns lacZ E. coli colonies pale blue on X- 
Gal (5-bromo-4-chloro-3-indolyl-p-D-galaclopyranoside) indi- 
cator plates, suggesting that the fusion is expressed at a rela- 
tively low level in E. coll 

To test the expression of the bioW-lacZ fusion, the fusion 
was introduced as a single copy into a B. subtilis protrotroph 
(PY79) and a similar strain (BI421) containing a mutation in 
the unlinked B. subtilis birA gene (7), a gene with similarity to 
the E. coli birA gene whose product serves as both the repres- 
sor for the biotin operon and the ligase that biotinylates acetyl- 
CoA carboxylase (5, 9). SPp specialized transducing phage 
(63) carrying bioW-lacZ was constructed and used to insert the 
fusion into the chromosome of PY79 and BI421 as described in 
Materials and Methods. The resulting Bio* partial diploids 
were grown in the presence or absence of biotin. As judged by 
the levels of P-galactosidase activity, the levels of S?$::bioW- 
lacZ expression were very low, but this expression showed 
biotin-specific regulation (Table 3). p-Galactosidase activity 
was repressed by about 10-fold in the presence of exogenous 
biotin. In a birA mutant strain, constitutive expression of the 
fusion was observed. However, the level of 0-galactosidase 
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TABLE 3. Biotin-regulated expression of S?$::bioW-lacZ 
iranslationa! fusion 



Relevant genotype 


p-Galactosidasc sp act (Miller units)" 
With biotin 6 Without biotin 


bioW-lacZ hio + 
bioW-lacZ bio* birA 


0.05 ± 0.01 0.5 ± 0.06 
0.8 ± 0.15 0.9 ± 0.07 



" Data arc averages ± standard deviations for two isolates and two assays each 
(calculated according to the method of Miller [35]). 
6 Biotin was present at 100 u.g/liter. 



activity in the birA strain was only somewhat higher than the 
levels observed in PY79 containing S?$::bioW-lacZ and grown 
under nonrepressing conditions. Similar results were obtained 
when a bioW-lacZ fusion was introduced by integration of a 
circular plasmid (pBI0397air) by Campbell-like recombina- 
tion at the bio locus (data not shown). These results suggest 
that the B. subtilis bio promoter is regulated by birA and biotin, 
as is the case for the divergent bio promoters of E. coli. In 
future work, it will be interesting to establish whether the 
B. subtilis bio operon is regulated by the B. subtilis BirA by a 
repressor-operator mechanism similar to that used in E. coli 
for the regulation of biotin biosynthesis. 

The B. subtilis biol gene complements both E. coli bioC and 
bioH mutants. The presence of two genes, biol and orfl, with 
homology to neither bioC nor bioH of E. coli, raised the issue 
of which gene(s) was complementing which E. coli mutant. 
Complementation studies using plasmid subclones that con- 
tained either biol or orf2 alone under the transcriptional con- 
trol of the lacZ promoter (see Materials and Methods) indi- 
cated that biol alone was sufficient to complement both E. coli 
BM7086 (bbioH) and E. coli R878 (bioC). Plasmids containing 
orfl did not give normal complementation of either E. coli 
BM7086 or E. coli R878. The cytochrome P-450-like product 
of the biol gene of B. subtilis can apparently supply an activity 
needed for biotin synthesis that can substitute for, or bypass, 
the activity missing in either bioC or bioH mutants of E. coli. 

Insertional mutagenesis of the bio operon and flanking cod- 
ing regions. To verify the boundaries of the bio operon pre- 
dicted from the nucleotide sequence and to confirm the roles 
of previously unidentified bio genes, a cat cassette (chloram- 
phenicol resistance gene) was used to construct insertions or 
deletions in bioW, bioB, biol, orf2, the bio promoter region, 



orf3, orf4, and orf6. First, plasmid derivatives containing these 
mutations were constructed in E. coli, and then the cat inser- 
tions were transferred to the bio locus of B. subtilis by DNA 
transformation (see Materials and Methods). The locations of 
these mutations are diagrammed in Fig. 4. As summarized in 
Table 4, insertions into orf3 and orf6 and deletion of orf4, 
which represent mutation of the coding regions located outside 
of the predicted bio operon, generated Cm r , prototrophic col- 
onies. Insertions and deletions in the bio operon gave results 
that generally supported the conclusions from the nucleotide 
sequence data. Replacement of the region upstream of bioW 
containing the putative P bio promoter with the cat gene ori- 
ented opposite to the biotin operon and interruption of bioW 
with the cat gene oriented in either direction relative to the bio 
operon generated an unambiguous Bio" phenotype. However, 
replacement of the putative P bio promoter region with the cat 
gene inserted in the same transcriptional direction as the biotin 
operon generated Bio~ cells that reverted to Bio + at a high 
frequency (0.1%). Bioassay experiments indicated that biotin 
vitamer production from such a Bio + revertant was increased 
in the presence of low concentrations of chloramphenicol, sug- 
gesting that expression of the biotin operon resulted from 
read-through transcription from the chloramphenicol-induc- 
ible cat promoter. We also observed that the bioB gene was 
expressed when the cat gene was inserted into the biotin 
operon upstream of bioB and oriented in the same transcrip- 
tional direction, as judged by growth of such bacteria on DTB 
(Table 4). 

Deletion (Pmll to BspBl) of the 3' end of bioB also gener- 
ated a Bio" phenotype, confirming that bioB was required for 
biotin biosynthesis. However, the 3' end of the operon could 
not be definitively identified by this genetic method. Insertions 
into biol resulted in Cm r colonies that were partially deficient 
in biotin production, i.e., that grew poorly on biotin-free me- 
dium but grew as well as wild-type colonics in the presence of 
biotin (33 (xg/ml), whereas the orflwcat mutation gave Bio + 
colonies. These results suggested that biol is not absolutely 
required for biotin production and that the orfl gene product 
is dispensable for biotin biosynthesis. The question of whether 
orfl encodes a redundant enzyme that functions in biotin syn- 
thesis or simply an unrelated protein awaits further experimen- 
tation. The bioA gene of E. coli is also located in an operon 
with another ORF {orfl) also of unknown function. However, 



H "* r H 




FIG. 4. Locations of ^/-containing insertions and deletions within the B. subtilis bin operon and flanking DNA As described in Materials and Methods, in vivo 
mutations of the bio genes and flanking open reading frames were generated either by inserting a 1.5-kb car-containing cassette into the indicated restriction site (A, 
Xhuth B, Snial: C. BamlU; or D, A'coRV) or by replacing the indicated region with the cat cassette (E, replacement of a 606-bp Sstl fragment: F, replacement of a 
260-bp Pm!l-BspE\ fragment; G, replacement of three adjoining Ilpal fragments totalling 313 bp; II, replacement of a 966-bp BstBl fragment). Not all restriction sites 
are shown. B. subtilis strains containing these mutations were examined for their biotin phenotypes. and the results are tabulated in Table 4. 
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TABLE 4. Characterization of insertion and deletion 
derivatives of the biotin operon 


Biotin operon derivative 
(mutation) and cat 
gene orientation" 


Biotin 
phenotype*" 


Minimal 
medium** 


Growth on c : 
DTB* 


Pimelic 

acid^ 


Wild-type bio operon 




+ 


+ 


+ 


A (Qorf3) 










R 




+ 






L 


+ 


+ 






b (ntioi) 










R 


+/- 


+/- 


+ 


+ 


L 


+/- 






+ 


C {QbioVV) 










R 


- 


- 


- 


- 


L 


- 




+/- 




D (i 20/70) 










R 


+ 


+ 






L 


+ 


+ 






E (Aorf2) 










R 


+ 


+ 






F (bbioB) 










R 




















R 










L 










II (Aorf4) 










L 


+ 


+ 







* See Fig. 4 for a map of cat insertions within the biotin operon. Insertion 
derivatives having the cat gene in either orientation were obtained: R (right) and 
L (left) identify the transcriptional orientation of the inserted cat gene when the 
bio operon is oriented as shown in Fig. 4. 

b Biotin phcnotypc determined by patching bacteria on biotin-frcc agar plates. 
+. biotin prototroph; +/-, biotin bradytroph; -, biotin auxotroph. 
c +, prototrophic: +/-, bradytrophic: -, auxotrophic. 
d Growth of bacteria on Spizizen's minimal medium agar plates. 

* Growth of bacteria on biotin-free agar plates containing 33 u,g of DTB per 
liter. 

Growth of bacteria on biotin-frcc agar plates containing 33 u.g of pimelic acid 
per ml. 

s Appearance at a frequency of 0.1% of Bio + bacteria in which biotin synthesis 
is inducible by chloramphenicol. 



there is no sequence similarity between the B. suhtilis orf2 gene 
product and the E. coli orfl gene product. 

The biotin bradytroph phenotype generated by the biolv.cat 
mutation appeared to be caused by inactivation of biol rather 
than by a polar effect because strains with mutations disrupting 
the downstream gene orfl or orj'3 were Bio + . To determine 
whether the biol gene product was involved in formation of 
pimelic acid, we examined whether the biohxat mutation could 
be bypassed by feeding pimelic acid. Derivatives of PY79 con- 
taining biolixat with either orientation of the cat gene grew as 
well as wild-type strains on biotin-free medium containing 
pimelic acid (Table 4). These results confirmed that the biol 
gene product is involved early in the biotin pathway. 

E. coli cells expressing the bioWgene of/?, subtilis can utilize 
pimelic acid to synthesize biotin. On the basis of homology 
with the B. sphaericus bioW gene, we hypothesized that the 
B. subtilis bioW gene encodes a pimcloyl-CoA synthase (43). 
To further examine this gene-enzyme relationship, we tested 
whether B. subtilis bioW expression in E. coli could be utilized 
to synthesize biotin from pimelic acid as reported for the 
B. sphaericus bioW (17). First, a fragment containing the B. 
subtilis bioW gene and its promoter was cloned into plasmid 
pCL1921, generating pBIO403. Next, pBIO403 was introduced 
into E. coli UbioH or bioC mutants and the resulting strains 
were tested for complementation. E. coli does not have a bioW 



homolog, and bioC or bioH mutants of E. coli cannot be res- 
cued for growth on biotin-free medium by the addition of 
pimelic acid. However, both kbioH and bioC mutants of E. coli 
containing pBIO403 grew in the absence of biotin when, and 
only when, pimelic acid (30 |xg/ml) was added to the medium. 
This result suggests that bioW encodes a pimeloyl-CoA syn- 
thase that, in the presence of pimelic acid, can bypass bioH and 
bioC in E. coli. 

Early steps in biotin biosynthesis. The early steps in biotin 
biosynthesis appear to be different in the gram-negative bac- 
teria, such as E coli and Serratia marcescens, and the gram- 
positive bacteria, such as B. subtilis and B. sphaericus, two dis- 
tantly related Bacillus species. E. coli cannot use free pimelic 
acid as a precursor for biotin synthesis (14), and 13 C labeling 
experiments indicate that free pimelic acid is not an interme- 
diate in biotin biosynthesis (48). On the other hand, B. subtilis 
and B. sphaericus readily use pimelic acid, which is converted 
to pimeloyl-CoA by pimeloyl-CoA synthase, the product of the 
bioW gene. When supplied with the bioW gene from B. subtilis 
or B. sphaericus (17, 43), E. coli can use pimelic acid to bypass 
the biotin auxotrophy of bioC or bioH mutants. 

Is pimeloyl-CoA synthase an obligatory part of the biotin 
biosynthetic pathway in B. subtilis, or is it part of an alternative 
pimelic acid salvage pathway? While the answer to this ques- 
tion is not clear, preliminary experiments indicate that the 
bio W gene product is required for biotin synthesis in B. subtilis. 
Insertion of the cat gene in place of the promoter region of the 
biotin operon, oriented in the same direction as the bio operon, 
yielded Bio~ colonies that reverted to Bio + at a frequency of 
0.1%. Insertion of the same cat gene in bioW, also oriented in 
the same direction as the bio operon, yielded a nonreverting 
Bio" phenotype. However, such mutants were able to grow 
weakly on DTB or DAP A, indicating that the downstream 
bioB and bioD genes were being expressed. Furthermore, cells 
of B. subtilis containing an in-frame deletion within the chro- 
mosomal bioW gene were also Bio~ but were able to grow well 
on DTB or DAPA (unpublished results). We cannot rule out 
the possibility that both of these bioW mutations exert a polar 
effect on bioF or bioA that is more deleterious than the effect 
on bioD or bioB. However, it appears most likely that the 
pimeloyl-CoA synthase is required for biotin synthesis in B. 
subtilis and that pimelic acid is a bona fide intermediate in 
biotin synthesis in B. subtilis. 

On the basis of the cytochrome P-450-like structure of the 
Biol protein, we hypothesize that B. subtilis synthesizes pimelic 
acid by a pathway different from that of E. coli. Since other 
cytochrome P-450s arc capable of oxidizing unsaturated fatly 
acid (59), we suggest that Biol may function to oxidize the 
double bond of an unsaturated fatty acid. Since Biol will com- 
plement an E. coli bioC or bioH mutant in the absence of 
pimeloyl-CoA synthase, we further speculate that the Biol 
protein can use either a free fatty acid or the CoA thioesler of 
a fatty acid as a substrate to produce pimelic acid or pimeloyl- 
CoA, respectively. 
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We describe the genetic analysis of the bio operon of 
the biotin auxotrophic Bacillus subtilis natto OK2 strain. 
The OK2 strain would oniy cross-feed with the Esche- 
richia coli bioB mutant and also grew well in medium 
containing dethiobiotin. Sequencing analysis revealed 
two significant genetic alterations in the bioW and bioF 
genes within the bio operon of the OK2 strain. 
Complementation analysis with B. subtilis 168 bio 
mutants demonstrated that only the bioB gene could 
complement, but other bio operon genes could not. A 
bio* transformant, isolated from an OK2 strain, has 
biotin autotrophy. 

Key words: Bacillus subtilis nalto: biotin operon; bioB 

The biotin biosynthetic operon in Escherichia coli and 
Bacillus subtilis has been well documented at the 
biochemical and molecular biological levels. M) Analy- 
sis reveals that analogous enzymes from the two species 
are similar, and both operon s contain the bioF gene 
encoding 8-amino-7-ketopelargonic acid synthase, the 
bioA gene encoding diaminoperalgonic acid amino- 
transferase, the bioD gene encoding dethiobiotin syn- 
thetase, and the bioB gene encoding biotin synthetase. 
However, the early steps of the pathway, namely those 
involved in the synthesis of pimeloyl-CoA, are quite 
different. E. coli contains two genes; bioC, which is 
located in the bio operon. and bioH, which is not linked 
to the other bio genes, but the roles of these two genes 
have yet to be identified. On the other hand, B. subtilis 
contains the bioW gene encoding pimeioyl-CoA syn- 
thetase, which is also found in Bacillus sphaericus. 5) and 
the bio/ gene, which shows no homology to either bioC 
or bioH but is able to complement in either bioC or bioH 
of E. coli mutants. 3 * 

B, subtilis nalto is a commercially important micro- 
organism used in the fermentation of soybeans to make 
"natto", a popular food in Japan. Although DNA-DNA 
hybridization reveals that the genomic DNA of B. 
subtilis natto strains is highly homologous to that of B. 



subtilis 168 6) the B. subtilis natto strain requires biotin 
for growth. Here we describe the genetic analysis of the 
bio operon in B. subtilis natto OK2, 7) a highly trans- 
formable strain, and compare it to B. subtilis 168. The 
goal of the study was to construct the hyper biotin 
producer of B. subtilis natto and use it to make "biotin- 
rich natto". 

In order to investigate the biotin biosynthetic pathway 
in the OK2 strain, cross- feeding experiments were done 
as described 8 * with E. coli bio mutants (CGSC Yale 
University), using B. subtilis 168 as a control. Although 
B. subtilis 168 cross-fed the E, coli mutants R872 
(bioFWJ), R879 (bioA24\ R877 (bioDI9), and R875 
(bioB17). an OK2 strain cross-fed only by the R875 
{bioB17) strain (Table 1). Moreover, both strains could 
not cross- feed an E, coli R878 (bioC23) strain. These 
results suggested that an OK2 strain could only convert 
dethiobiotin into biotin during the last step of the biotin 
biosynthetic pathway. 

To analyze the biotin biosynthetic pathway of the 
OK2 strain at the molecular level, the bio operon derived 
from chromosomal DNA of the strain was cloned by 
PCR amplification with primers designed based on the 

Table 1. Cross- feeding Tests between E. coli bio Mutants and 
Bacillus Strains 



£ coli mutants Bacillus strains 



Strain 


Genotype 


B. subtilis 168 


B. subtilis natto OK2 


R878 


bioC23 






R872 


bioFI03 


+ 




R879 


bioA24 


+ 




R877 


bioDl9 


+ 




R875 


bioBi7 


+ 





Five E. coli bio mutants were separately streaked onto biotin-frcc medium in 
agar plates in which washed cells of Bacillus strains were suspended at a 
concentration of 6.0 x 10 s cells/ml. Cross-feeding under these conditions 
resulted from diffusion of biotin precursors excreted by the cells in the agar. 
Cross-feeding interactions were scored after 72 hours. +, growth: — , no 
growth. 



* To whom correspondence should be addressed. Fax: +81-29-888-8525; E-mail: krsy@mx.ibaniki.ac.jp 
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nucleotide sequences of the B. sttbtilis 168 bio operon. 3) 
The bio operon was partially amplified using the 
following pairs of primers: BW-1 (CATCGGCATGTC- 
TATGGGAGG) and BA-2 (TAACCGCTCGTTTAAC- 
CAGG), for bioWand bioA genes; BF-1 (AACAAGC- 
GATCCACGAGGTT) and BD-I (CTCTTCGTCAGT- 
CACTTCTG), for bioF and bioD genes; BB-1 (GAAT- 
CAAGTGGGGGTATGAG) and BI-2 (TTCGGCGG- 
GGCTGACACTTT). for bioB and biol genes, respec- 
tively. Nucleotide sequence analysis shows the bio 
operon to have a similar structure to that of B. subtilis 
168 with both operons arranged on a single operon in the 
order bioWAFDB and followed by two genes, bio! and 
ytbQ, (accession number of this sequence in the DDBJ, 
EMB L, and Gen Bank nucleotide sequence databases. 
AB088066) (Fig. 1). The amino acid sequence homol- 
ogies of these gene products with those of B. subtilis 168 
are extremely high, as follow: BioW, 98.2%; BioA, 
98.2%; BioF, 92.0%; BioD, 97.4%; BioB. 98.8%; Biol, 
97.2%. However, we found two significant differences 
between the bio operon of OK2 and that of B. subtilis 
168. First, a single-base change resulted in the replace- 
ment of Cys (TGC) (strain 168) by a stop codon (TGA) 
(strain OK2) at position 226 in the carboxy-terminus of 
BioW. Secondly, a 54 bp fragment encoding 18 amino 
acids in the bioF gene of B. subtilis 168 was largely 
deleted in positions from 848 to 901 in the OK2 strain. 
These results suggested that the bio operon in the OK2 
strain was genetically defective and therefore showed 
biotin auxotrophy. 
To analyze individual bio genes of the OK2 strain, 



each bio gene was tested for its ability to complement B. 
subtilis 168 bio mutants. Five B. subtilis 168 bio mutants 
(bioW, bio A, bioF, bioD and bioB) were constructed by 
insertional mutagenesis according to the method de- 
scribed previously. 3) Five bio genes of an OK2 strain 
were amplified with pairs of primers (Table 2). The 
amplified DNA product was subsequently digested with 
appropriate restriction enzymes (Table 2) and cloned 
into same restriction sites of the expression pi asm id 
pWH1520 (MoBiTec). Each of the composite plasmids 
was used to transform each of above bio mutants of B. 
subtilis 168, respectively and selected for biotin auxo- 
trophy or prototrophy (data not shown). Only plasmids 
carrying the bioB gene from the OK2 strain comple- 
mented the bioB mutant of B. subtilis 168. The other 
plasmids that carried the bioW, bioA, bioF, and bioD 
genes did not complement and these results were 
identical to those of the cross-feeding tests described 
above. Two genes of bio A and bioD from OK2 were 
highly homologous to those of B. subtilis 168, and seven 
(K39D, D67N, A201E, E205K, M219I, and S430T) and 
four (D28E, N29H, H32D. and RUSH) amino acids 
substitutions were detected in the bio A and bioD genes, 
respectively. These substitutions seem to be essential for 
enzyme activity and further studies are now in progress. 

In addition, we tested whether the OK2 strain could 
be used to synthesize biotin from its precursor dethio- 
biotin. The OK2 strain grew well on biotin-free medium 
containing dethiobiotin as well as medium containing 
biotin (Fig. 2). Moreover, when insertional mutagenesis 
was done on the bioB gene of OK2, 3) this mutant did not 



ioW bio A bioF bioD bioB 



natto OK2 




biol ytbQ 



673 ' 681 

168 GGC TGC CGA 



nattoOK2 GGC |TGAJC G A 
Gly STOP Arg 




Asn He lie Glu Ser Met lie Arg ^ 

844 _ 6 906 

168 AA[C)ATC ATT GAA AGCATGATC0GA 



natto OK2 AA[C]ATC 

844 

Asn lie 



Deletion 



[A)G A 



Arg 



852 



Fig. 1. Structure of the bio Operon. 

The locations of bin structural genes, putative promoter and regulatory regions, and transcription termination sites are shown in the upper 
diagram. Top set of arrows indicates the bio operon of B. subiilis 168 and lower set of arrows indicates B. subtilis natto OK2. Numbers shown 
next to the nucleotide sequences indicate distance (bp) from initiation codon. Black box in bioF indicates a deletion region. Symbols: arrowhead, 
open reading frames: ?. putative transcription termination site; f*\ possible <j a -recognized P bin promoter. 
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bio gene 



Sequence of primer (5' -* 3'; forward and reverse) 



Restriction site a 



bioW TA GGTAC C TAAC AATTTAGGTG AGAAG 

-57 -39 
TT AGATCT GGTAAATGGCAGCCAGAGG 
714 723 
bioA AT GGATC C TAAGATGTAAAC ACGTAC ATAC 

-75 -46 
CT GCATGCA TTGACCGCAGGTTACGATG 
1294 1313 
bioF AG GGATCC TGAAGAGCTCTCGGAAATG 
-59 -41 
GA GC ATGC GATATAAC CGTTTTC C CTAC 
1103 1132 
bioD CGGTTAACCATAGTATGGGTGATATTG 
-65 -49 
GAGTCGACCTCATACCCCCACTTGATTC 
678 698 
bioB AT AC TAGT TG ATG AATC A AGTGGGGG 

-25 -8 
TAGGATCCCTTTCAGCTTTTCGCAC 
995 1012 



Kpn\ 
BxIU 
Kpn\ 
Sph\ 
Kpn\ 
Sphi 
Hpa\ 
Sail 
Spel 
BamHl 



a Thc restriction site for cloning has been underlined in ihc sequence. 

Numbers shown on the primer sequence (bold type) are indicated in terms of the distance (bp) from the initiation codon. 



10 




Incubation time (hrs) 

Fig. 2. Growth of bioB* and bioBr Strains of B. suhtilis Natto OK2. 
B. suhtilis natto OK2 was grown aerobically at 37 C C in minimal 
medium (I4g of K 2 HP0 4 , 6g of KH>P0 4 , 1.9g of sodium citrate. 
2g of (NH 4 ) 2 SO ; . 1.4 g of MgS(V7H>0. 5 g of glucose and I liter 
of deionized water) with or without biotin (O.lng/ml) and 
dethiobiotin (0.1 ng/ml), respectively. Cell growth was monitored 
by measuring the optical density at 660 nm. Symbols; O. bioB* 
strain with biotin: ■. bioB* strain with dethiobiotin: bioB~ 
strain with dethiobiotin: □. bioB* strain without biotin. 



grow on medium containing dethiobiotin (Fig. 2). These 
results confirm that the bioB gene product in the OK2 
strain is indeed involved in the last step of the biotin 
biosynthetic pathway. 

The results of this study using B. subtilis are similar to 
those of Hatakeyama et al. who used biotin-requiring 
coryneform bacteria. 8) By using cross-feeding studies 
with E. coli bio mutants, they demonstrated that coryne- 
form bacteria lack of the enzymes involved in the early 
steps of the pathway, encoded by the bioF, bioC, and 
bioH genes. Taken together, the above results indicate 
that biotin auxotrophic microorganisms lack the func- 
tional genes involved in the early steps of the biotin 
biosynthetic pathway. 

To confirm whether biotin auxotrophy is due to a 
defect of the above genes (bioW, bioF, bio A, and bioD 
genes) in the bio operon of the OK2 strain, we attempted 
to repair those genes by homologous recombination with 
the whole bio operon of B. subtilis 168. Strain OK2 was 
transformed with amplified DNA containing the bio 
operon by using the primers BW-1 and BI-2 and bio + 
transformants were obtained on biotin-free medium. The 
nucleotide sequence analysis of five bio + transformants 
confirmed that all of the transformants contained the 
substituted bio operon (data not shown). To evaluate the 
biotin prototrophy. we examined the growth of these 
transformants on biotin-free medium. Although the bio + 
transformants grew well in both biotin-free medium and 
biotin-containing medium, the growth rate of this strain 
decreased gradually over repeated cultivations in spite 
of no alteration in the nucleotide sequences of the bio 
operon (data not shown). These results suggest that 
although bio + transformants have biotin autotrophy, 
they are unstable genetically. Although B. subrilis 168 
synthesizes pimelic acid as a true intermediate in the 
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early steps of biotin biosynthesis, 3 * the precursor of 
pimclic acid is still unknown as well as in the case of B. 
splutricus. 51 Therefore, we conclude that the early steps 
to produce pimelic acid are genetically unstable in the 
OK2 strain as compared with the case of B. subtilis 168. 
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